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STRUCTURE FOR REDUCING LEAKAGE CURRENT IN SUBMICRON IC 

DEVICES 

FIELD OF THE INVENTION 

[0001] The present invention relates to reducing power 
consumption in integrated circuit chips and more 
particularly to reducing power consumption by reducing 
leakage current in submicron IC devices, particularly deep 
submicron CMOS IC devices . 

BACKGROUND 

[0002] It is expected that future electronic systems will 
continue to require ever increasing speed and that they 
consume as little power as possible. To attain higher 
speeds and maintain low power consumption, integrated 
circuit (IC) chips, as one of the principle components of 
electronic systems, will need to operate at ever higher 
frequencies while consuming as little power as possible. 
For IC chips manufactured using CMOS technology, as device 
sizes shrink to increase speed and reduce chip size, 
increased static power consumption will become a major 
hurdle to attaining low power consumption goals. Static 
power is consumed by circuits, and, individual devices that 
are not actively changing states, i.e., the transistors are 
in a steady off state. Up until now, static power 
consumption in CMOS technology has been negligible. But the 
continued shrinking of device sizes will change this. 
[0003] Continuing process advancements have allowed for 
reductions in critical dimensions in CMOS manufacturing. IC 
device dimensions have now reached or are about to reach a 
critical point where static power consumption will become a 
major concern unless new techniques are implemented to avoid 
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unacceptable static power consumption levels. As device 
sizes have shrunk there has been a reduction in power supply 
voltage (V dd ) . While lower V dd corresponds to lower dynamic 
power consumption, it also reduces the speed of the device . 
To maintain or increase device speed, efforts have been 
pursued to reduce the threshold voltage (V t h) of transistors 
within a given process. However, the subthreshold voltage 
current, or leakage current, of a transistor exponentially 
increases with any V t h decrease. At prior larger device 
dimensions, this exponential leakage current increase was 
still negligible. But, at current and future device 
dimension sizes , this exponential increase in leakage 
current will result in a rapid and noticeable increase in 
static power consumption. Thus, without employing a new 
approach, the designer may be required to make unacceptable 
trade off decisions between speed and power consumption. 
[0004] To counteract this increasing power consumption 
problem, it would be possible to increase the voltage 
threshold level of the transistors, however, this would have 
negative impacts on the transistor speed or frequency at 
which the device could be used. Furthermore, increasing Vth 
can introduce other problems because of noise margins that 
must be maintained within the device. It has been found that 
increasing V t h to more than V d d/3 will negatively impact the 
functionality of the device. 

[0005] As device sizes continue to shrink, this static 
power consumption issue will become more important for the 
entire semiconductor industry. This is particularly 
important now for makers of chips such as Field Programmable 
Devices (FPDs) with a large number of transistors on a 
single die, of which the majority will remain in a static 
off state. 
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[0006] An FPD is a programmable logic device comprised of 
an array of Configurable Logic Elements (CLEs) surrounded by 
a General Routing Matrix (GRM) with a periphery of 
input /output ports. In general, FPDs include programming 
elements such as static random access memory cells (SRAMs) , 
antifuses, EPROMs, Flash cells, or EE PROMS . These memory 
elements are used to control the functions performed by the 
CLEs, the routing of signals in the GRM between CLEs, and 
the functionality of the input/output ports. Recently, FPD 
makers have trended toward providing a large number of 
drivers or buffers to support high fan out signals to be 
routed in the GRM. An FPD is designed to perform any logic 
function required by a user. 

[0007] In practice, once a FPD user designs the function 
to be implemented by the FPD, and the FPD is programmed to 
perform the function, a large number of the resources 
available on the FPD are unused. Thus, the FPD may have a 
large percentage of transistors that are not being used at 
any given time. Xilinx, a leading manufacturer of FPDs, 
makes a variety of FPD known as a field programmable gate 
array (FPGA) . Analysis of typical designs used by users of 
Xilinx FPGAs shows that anywhere from 60 to 90 percent of 
the FPGA resources are typically unused. These unused 
resources are in a static mode and thus as static power 
consumption increases for a given process the FPGA or FPD is 
likely to see large increases in overall power consumption. 
[0008] It is desirable then to implement new circuit 
techniques that will operate at increased speeds and reduce 
leakage current in CMOS devices and thereby reduce IC chip 
power consumption. 
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SUMMARY OF THE INVENTION 

[0009] The present invention solves the above-described 
problems and provides a distinct advance in the art of CMOS 
IC design. More particularly, the present invention 
provides a technique for reducing leakage current in static 
CMOS devices by adding additional transistors in series 
between selected inverters or logic gates and ground or 
power . 

[0010] In a first separate aspect of the invention, NMOS 
and PMOS transistors are added to selected buffers comprised 
of two inverters in series. The PMOS transistor is connected 
between the first inverter and power supply and the NMOS 
transistor is connected between the second inverter and 
ground. The added transistors are controlled by a memory 
cell to be on when the buffer is used and off when the 
buffer is unused. 

[0011] In a second separate aspect of the invention an 
NMOS transistor is added to selected buffers comprised of 
two inverters in series. The NMOS transistor is connected 
between the second inverter of a selected buffer and ground 
and controlled by a memory cell to be on when the buffer is 
used and off when the buffer is unused. A PMOS transistor 
of the first inverter is manufactured to sit in a V gg well. 
[0012] In a third separate aspect of the invention an 
NMOS transistor is added to selected buffer pairs each of 
which is comprised of two inverters in series. A single 
NMOS transistor is connected between the second inverter of 
each selected buffer and ground and controlled by a memory 
cell to be on when one or both of the buffers are used and 
off when both of the buffers are unused. A PMOS transistor 
of the first inverter in each buffer is manufactured to sit 
in a Vgg well . 
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[0013] In a fourth separate aspect of the invention each 
of the above recited techniques are employed in drivers used 
to drive signals onto high capacitance interconnect lines 
where the drivers comprise buffers made of logic gates in 
series with an inverter. 

[0014] In a fifth separate aspect of the invention each 
of the above recited techniques are employed in the 
circuitry of a field programmable gate array. 

[0015] These and other important aspects of the present 
invention are described more fully in the detailed 
description below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0016] A preferred embodiment of the present invention is 
described in detail below with reference to the attached 
drawing figures, wherein: 

[0017] Fig. 1 is a high-level block diagram depicting an 
FPGA with which the present invention may be used; 
[0018] Fig. 2 is a more detailed block diagram showing 
one tile of Fig. 1; 

[0019] Fig. 3 is a schematic diagram of a portion of the 
OMUX of Fig, 2; 

[0020] Figs. 4A and 4B show schematic diagrams of one 
embodiment of the stacking technique of the invention; 
[0021] Fig. 5 is a schematic diagram of one embodiment of 
the stacking technique of the invention; 

[0022] Fig. 6 is a schematic diagram of one embodiment of 
the stacking technique of the invention; 

[0023] Fig. 7 is a schematic diagram of a vertical long 
line driver utilizing one embodiment of the stacking 
techniques of the present invention; 

[0024] Fig. 7a shows detail of region 171 of Fig 7; and 
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[0025] Fig. 8 is a schematic diagram of a horizontal long 
line driver utilizing one embodiment of the stacking 
techniques of the present invention. 

[0026] The drawing figures do not limit the present 
invention to the specific embodiments disclosed and 
described herein. 

DETAILED DESCRIPTION 

[0027] Although the circuit techniques described herein 
are not limited to use in FPDs, each embodiment of the 
invention will be described with reference to a particular 
FPGA manufactured by Xilinx, Inc., commercially known as the 
Virtex® FPGA. (Virtex is a registered trademark of Xilinx, 
Inc., owner of the present patent application.) Turning to 
the drawing figures, and particularly to Fig. 1, one FPGA, 
which may be constructed in accordance with an embodiment of 
the invention, is illustrated. 

[0028] Fig. 1 is a block diagram depicting the general 
architecture of an FPGA. Fig. 1 illustrates part of an 
array of tiles in an FPGA. Each tile includes a logic block 
L and a routing or switching structure R. Also in each tile 
are line segments which connect to line segments in adjacent 
tiles to form interconnect lines. The CLEs mentioned above 
are contained in the logic blocks L and the interconnect 
lines and switching structures R combine to form the GRM. 
Most of the interconnect lines in the FPGA are not shown in 
Fig. 1 so as not to obscure the figure. Three of the tiles, 
labeled TILE 1, TILE 2, and TILE 3 are shown in more detail. 
Short and intermediate length interconnect lines are shown 
extending between TILE 1, TILE 2, TILE 3, and their 
immediate neighbors. The heavy black lines labeled 6VM, 
6VN, and 6VS represent interconnect lines running generally 
in a north-south direction spanning multiple tiles. There 
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are similar interconnect lines running horizontally in the 
device but not shown. The single length lines of TILE 2 
labeled N, S, E, and W represent multiple lines connecting 
between adjacent tiles and connecting to the logic block L 
of TILE 2. The details of the interconnection architecture 
such as the numbers of conductive lines and their length are 
not important to the invention and will only be discussed as 
necessary with respect to later figures. The logic blocks L 
represent logic function generators. In the FPGA of Fig. 1 
these logic blocks comprise a plurality of lookup tables. 
[0029] Included in the FPGA of Fig. 1, but not shown, is 
an array of memory cells that are used to control the 
functionality of the logic blocks L as well as the 
connection of the interconnect lines to each other. These 
memory cells may be SRAM cells, Flash cells, EPROMs, 
EEPROMs , or anti-fuses, but are SRAM cells in the preferred 
FPGA. In general, the interconnect lines are connectable to 
each other and the single length lines may also be connected 
to the logic block L. At the periphery of the tile array 
are input/output blocks containing input/output ports, which 
are not shown on Fig. 1. The functionality of the 
input/output ports is also controlled by memory cells. 
[0030] Turning to Fig. 2, the details of a single tile 
are depicted at a block diagram level. The tile includes 
vertical and horizontal lines, each representing a plurality 
of conductive traces on which logic data signals may be 
routed. In general, there are three varieties of conductive 
lines that enter and exit at the periphery of the tile. 
These varieties include long lines (LVN, LVS, LHE, LHW) , 
intermediate lines (VN, VS, HE, HW) , and direct connect 
lines (N, S, E, W) . The long lines extend for the entire 
length or width of the array, or half of the length or width 
of the array in a second embodiment. The intermediate lines 
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extend for multiple lengths of tiles (six or two tiles in 
this embodiment) . The direct connect lines connect between 
adjacent tiles. 

[0031] There are four major functional blocks depicted in 
Fig. 2. The first is a region labeled NON-LOCAL 

INTERCONNECT. Within this region each of the direct connect 
lines N, S, E, W may be connected to each other, some of the 
intermediate length lines 40, 42 can be connected to each 
other, some of the intermediate length lines 40, 42 can be 
connected to the direct connect lines, and some of the long 
lines 44, 46 can be connected to intermediate length lines. 
A detailed description of a similar interconnection scheme 
may be found in U.S. Patent 5,914,616 entitled W FPGA 
Repeatable Interconnect Structure with Hierarchical 
Interconnect Lines", which is hereby incorporated by 
reference. The connections that will be made are determined 
by the programming of memory cells that control NMOS pass 
transistors . 

[0032] The second major functional block is labeled IMUX. 
The IMUX region comprises multiplexers that determine which 
signals will be input to the CLE. The multiplexers are 
comprised of buffered NMOS pass transistors controlled by 
memory cells. Each of the direct-connect lines N, S, E, W 
is an input to the IMUX region. In addition, some of the 
intermediate length lines 50 and some of the CLE output 
lines 52 are connected into this region. 

[0033] The third major functional block is labeled CLE. 
Within this region are logic function generators that, in 
one embodiment, comprise lookup tables (LUT) . The LUTs 
comprise memory cells that are programmed to perform the 
user's logic function. 

[0034] The fourth major functional block is labeled OMUX. 
The OMUX region comprises multiplexers that select which of 
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the CLE outputs will be connected back into the NON-LOCAL 
INTERCONNECT region to be distributed to other tiles or 
input/output ports. The tile of Fig. 2 is further described 
in U.S. Patent 5,914,616. 

[0035] Turning to Fig. 3, a schematic diagram of one of 
the multiplexers from the OMUX region is depicted. On the 
left of the diagram are 24 inputs, labeled IN0-IN23, that 
are connected on the right to a pass transistor selection 
matrix 100. The pass transistor selection matrix is 
comprised of two stages. At the first stage, four of the 24 
inputs are selected to pass through to the second stage 
where one of the four inputs is passed to the output. In 
the first stage, the 24 inputs are grouped into four groups 
of six. For example, IN16-IN23 make up the top group. A 
pass transistor is connected to each input line on one side 
and to a first stage common node 102 on the other side. 
Thus, there are four first stage common nodes 102, and each 
first stage common node is an input to a second stage pass 
transistor. The second stage pass transistors each have one 
side connected to one of the first stage common nodes and 
the second side connected to a second stage common node 104. 
There is one second stage pass transistor associated with 
each of the four groups. The second stage common node 104 
is input to a buffer 106 comprised of two inverters 
connected in series. The output 108 of the buffer becomes 
one of the outputs from the OMUX region. In the example 
FPGA there are 16 such multiplexers in the OMUX region. 
Across the top of the selection matrix are memory cells 110 
with outputs connected to the control gates of the pass 
transistors. The memory cells are powered by a Vgg voltage, 
which is 10-15% higher than the supply voltage Vdd. This 
minimizes voltage drop across the NMOS pass transistors when 
passing logic 1. From left to right, the first six memory 
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cells each control four separate pass transistors, one from 
each of the four groups in the first stage of the selection 
matrix 100. The remaining four memory cells control a 
single pass transistor in the second stage of the selection 
matrix. The por_b signal is a reset signal, and goes low 
only at power-up, to cause all output signals to be logic 1. 
The transistor controlled by por_b is very small, and its 
static power leakage is therefore negligible. 
[0036] In operation, software ensures that only one of 
the first stage memory cells and one of the second stage 
memory cells are programmed with a logic "1" so that only 
one path is conducting and there is no contention. For 
example, IN23 will be passed to the second common node 104 
when the first and seventh memory cells are programmed with 
a logic " 1". The remaining memory cells will each be 
programmed with a logic "0". With the appropriate 
programming, any one of the 24 inputs can be selected to 
drive the OMUX output. 

[0037] Figs. 4A and 4B show one embodiment of a circuit 
design technique referred to as transistor stacking. Fig. 
4A shows the detailed circuit 106 implementation including 
the two inverters of Fig. 3. The control gate 12 0 of PMOS 
transistor 116 is connected to the first inverter output 
122. PMOS transistor 116 acts as a pull-up transistor 
speeding signal propagation when a weak high voltage signal 
is present on inverter input 118. The output of the first 
inverter 122 is connected to the input of the second 
inverter, which is a standard CMOS inverter. These 
inverters operate together to form a buffer, as is well 
known to one of ordinary skill in the art. In order to 
maximize switching speed while not excessively increasing 
leakage, only one transistor 114 in buffer 106 is made to be 
a low threshold transistor. (The triangle symbol in 
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transistor 114 indicates that it is a low threshold 
transistor.) Transistor 114 is chosen because it is part of 
the first inverter stage of the buffer, and therefore of 
smaller size than those of the second stage. Also, the NMOS 
transistor 114 rather than the PMOS transistor 112 is chosen 
to be low threshold because high switching speed is 
important in the presence of a rising input signal at node 
118. Note that transistor 114 will be on when buffer 106 is 
not used, and because there is no voltage drop across 
transistor 114 when it is on, this low threshold transistor 
will not consume static power. 

[0038] Fig. 4B depicts the same circuit but with two 
additional transistors and a memory cell added. For ease of 
reference, like structures are given identical numbers. The 
first inverter has an additional PMOS transistor 124 
connected between PMOS transistor 112 and V dd . Transistor 
124 is referred to as a virtual V dd transistor. The second 
inverter has an additional NMOS transistor 126 connected 
between NMOS transistor 12 8 and ground. This additional 
NMOS transistor is referred to as a virtual ground 
transistor. The control gates of virtual V dd transistor 124 
and virtual ground transistor 126 are connected to the 
complemented and true output of a memory cell 130 
respectively. Depending on the requirements of the FPGA 
user's design, any given switch and its corresponding buffer 
in the OMUX region may or may not be used. If the 
particular buffer is used, memory cell 13 0 will be 
programmed to turn both virtual transistors on, and the 
first inverter will be connected to power and the second 
inverter will be connected to ground. In this state, the 
buffer will act essentially as if the virtual power and 
ground transistors did not exist. If the buffer is unused, 
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memory cell 13 0 will be programmed to turn the virtual 
transistors off. 

[0039] The addition of the stacked virtual transistors 
significantly reduces the leakage current associated with 
the buffer. When the switch or buffer is not used, 
depending on device sizes , leakage current will be reduced 
by a factor of 10-20. This can be shown mathematically using 
the sub-threshold current formula in BSIM 3.3 spice models. 
Simulation of this stacking concept in a 0.15 micron process 
and in a 0.18 micron process confirm these results. Similar 
results also show that stacking will reduce the sub- 
threshold current even further in dimension sizes smaller 
than the 0.15 micron process technology. Therefore, as the 
process technology shrinks, utilizing the stacking concept 
becomes more advantageous . 

[0040] Addition of the virtual power and ground 
transistors comes with the penalty of increased area 
required for each buffer. Fig. 5 shows an alternative 
embodiment in which one of the virtual transistors 124 from 
Fig. 4, virtual V dd , is eliminated. In this embodiment, no 
virtual V dd transistor is added, but, the PMOS transistor 112 
of the first inverter should be manufactured to sit in a V gg 
well, indicated by region 132. V gg refers to a voltage which 
is typically 10-15% higher than normal power, V dd . 
Simulation results in a 0.15 micron process technology show 
that manufacturing the PMOS transistor 112 to sit in a V gg 
well reduces its leakage current by a factor of more than 2 . 
This is mainly due to the dependency of threshold voltage on 
V bS / the voltage between the base or substrate well and the 
source terminal of the transistor. When the source of a 
PMOS transistor is connected to V dd and the base or well is 
connected to V gg , V bs will be equal to V gg -V dd , causing a 
reduction in the leakage current. 



12 



X-838 



PATENT 



[0041] Fig. 6 shows yet another alternative embodiment, 
which, in addition to eliminating the virtual V d a transistors 
by manufacturing the inverter PMOS transistors in a V gg well, 
groups OMUX output buffers into pairs so that each buffer 
pair shares a common virtual ground transistor. In this 
figure, the first inverter stage of each buffer 150 (now 
schematically shown by the common logic symbol) is 
manufactured identically as described with respect to Fig. 
5. The second stage differs in that only one virtual ground 
transistor 152 is added for each buffer pair and the source 
of the NMOS transistor 154 from each second stage inverter 
is connected to the drain of the virtual ground transistor 
152. A single memory cell 156 controls virtual ground 
transistor 152 for the buffer pair. This embodiment further 
reduces the area penalty incurred by stacking by cutting in 
half the number of virtual ground transistors and the number 
of memory cells that would be needed to implement the 
stacking technique of Fig. 5 for each OMUX output buffer. 
[0042] However, the Fig. 6 embodiment adds complexity to 
the "place and route" software needed to program the FPGA. 
Place and route software is part of the automated tool used 
to implement a design in an FPGA. Briefly, after a user 
determines the functionality required of an FPGA, that 
design is implemented by automated software tools that 
determine how to configure the logic blocks and connect them 
together. The place and route step in this process 
determines which logic blocks and which interconnect lines 
will be used to implement the user's logic and connect the 
logic blocks together. The stacking technique embodiment of 
Fig. 6 complicates the place and route analysis because to 
maximize the reduction in static power consumption, used and 
unused buffers in a given design should be separated into 
different groups. There is no static power consumption 
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savings if the virtual ground transistor of an unused buffer 
is turned on. Thus, to the extent possible, the place and 
route software should attempt to route pairs of active 
signals through buffer pairs that share the same virtual 
ground transistor. 

[0043] In cases where resource utilization is very low, 
the leakage savings will be significant even without a 
special routing effort by the place and route software. 
[0044] Although three different stacking technique 
embodiments are described above, there is no reason all 
three could not be designed into a single FPD depending on 
design considerations. 

[0045] Regardless of the stacking technique employed, 
after the place and route step has been completed, the 
software must determine the programming of the memory cells 
that control virtual ground and/ or virtual power 
transistors. As will be appreciated by one of ordinary 
skill, this is a straightforward procedure of identifying 
used and unused buffers and setting the virtual control 
memory cells with a "1" or a "0" as appropriate. The 
configuration data for programming virtual control memory 
cells will be included in the configuration data file for 
the device. 

[0046] Returning to Fig. 2, depicted on the left hand 
side of the figure is a vertical line labeled LVS at the 
bottom and LVN at the top. This line represents four 
vertical long lines which may run for the height of the 
array, or half of the array height in a second embodiment. 
Each row and column has such long lines. However, for 
clarity they are only depicted in this single column. 
[0047] Fig. 7 is a circuit schematic showing the 
structure utilized to route signals onto a vertical long 
line such as that shown in Fig. 2. On the left hand side of 
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the figure are ten inputs IN0-IN9 160. These inputs feed a 
two-stage pass transistor selection matrix 162 with two 
groups of five inputs in the first stage and two pass 
transistors controlling selection paths in the second stage. 
Across the top of Fig. 7 are seven memory cells 164 that 
control the selection matrix. From left to right, the first 
five memory cells control the first stage of the selection 
matrix in exactly the same manner as described with respect 
to Fig. 3. The sixth memory cell directly controls the 
lower pass transistor 166 in the second stage of the 
selection matrix in the same manner as described with 
respect to Fig. 3. The upper transistor 170 of the second 
stage is controlled by a two input NOR gate 168 that 
receives one of its inputs from the true output of the sixth 
memory cell, and its other input from the complemented 
output of the seventh memory cell . The upper pass 

transistor 17 0 will be "on" only when both inputs to NOR 
gate 168 are low. NOR gate 168 reduces leakage current by 
ensuring that top transistor 170 in the second stage is off 
when unused, and the NOR gate is smaller than a memory cell, 
thus saving space on the chip. The output of the seventh 
memory cell feeds several different structures in the 
circuit of Fig. 7. The seventh memory cell will be 
programmed with a logic "1" only if the vertical long line 
driver will be used in the user's design. If the driver is 
in use, NOR gate 168 will receive a "0" from the 
complemented output of the seventh memory cell. Thus, in 
this state NOR gate 168 will act as an inverter of the 
signal from the sixth memory cell, determining the 
conducting path in the second stage of selection matrix 162. 
[0048] The structures within the dashed region 171 
combines to operate as a buffer which drives signals onto 
the long line. NAND gate 176 from region 171 is shown in 
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greater detail in Fig. 7a. NAND gate 176 is made of two 
PMOS transistors 22 0 and two NMOS transistors 222 and 224. 
The operation of this NAND gate is readily apparent to one 
of ordinary skill in the art. As shown in Fig. 7a, the 
source of NMOS transistor 224 is connected to the drain of 
virtual ground transistor 172. Returning to Fig. 7, when 
the seventh memory cell is programmed with a logic "1", 
virtual ground transistor 172 in region 171 will be on, 
enabling the buffer. NAND gate 176 receives a logic "1" 
from the seventh memory cell and NOR gate 178 receives a 
logic "0" from the complemented output of the seventh memory 
cell. As will be appreciated by one of ordinary skill in 
the art, under these conditions both logic gates will invert 
their other input which comes from the data signal line 180 
connected to the second common node of the pass transistor 
matrix 162. The second inverter 182 of the buffer then 
inverts the data signal again, completing the buffer 
function. A feedback path is provided from the output of 
NOR gate 178. If the data signal on signal line 180 is 
high, NOR gate 178 inverts the data signal, which causes 
PMOS transistor 184 to turn on and pull data line 18 0 high. 
If the data signal on node 180 is low, NOR gate 178 will 
output a high signal, turning PMOS transistor 184 off. As 
in Fig. 3, the por_b signal goes low only at power-up to 
cause all output signals to be logic 1. The transistor 
controlled by por_b is very small, and its static power 
leakage is therefore negligible. 

[0049] Fig. 8 depicts the circuitry to drive signals onto 
horizontal long lines. Dashed region 190, comprised of NAND 
gate 192, NOR gate 194, inverter 196, memory cell 198, and 
virtual ground transistor 200, and PMOS transistor 204 
operates in exactly the same manner as described with 
respect to Fig. 7. Fig. 8 differs from Fig. 7 in that 
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selection matrix 206 has fewer inputs and includes two NOR 
gates 2 08, 210 as part of a decoding structure to select 
which signal will be passed to the buffer 190. NOR gates 
2 08 and 210 each receive one input from the third memory 
cell 198. NOR gate 208 receives its second input from the 
first memory cell. NOR gate 210 receives its second input 
from the second memory cell. If the third memory cell is 
programmed with a logic "1" then the structure of Fig. 8 
will be active. NOR gates 208 and 210 will invert the 
values programmed into the first and second memory cells 
respectively, and the programming of those memory cells will 
determine which signal is passed through selection matrix 
206. The first memory cell in conjunction with NOR gate 208 
controls the first stage of selection matrix 206. Likewise, 
the second memory cell in conjunction with NOR gate 210 
controls the second stage of selection matrix 206. If the 
third memory cell is programmed with a "0 n , the outputs of 
NOR gates 208 and 210 will be low. 

[0050] Although one stacking technique has been 
demonstrated with respect to Figs. 7 and 8, each of the 
above disclosed techniques may be applied to the structures 
of these figures as well. For instance, it would be 
possible to group long line drivers into pairs as 
demonstrated in Fig. 6 for the OMUX driver. Although 
particular drivers in the example FPGA have been shown 
utilizing the various embodiments of the present invention, 
there are other drivers in the device to which these 
techniques can be applied. Furthermore, in general, the FPD 
industry is trending towards the use of more and more 
drivers on any given device to enhance speed and fan out 
capacity within the routing matrices on the FPDs, making the 
techniques of this invention more important as device sizes 
shrink . 
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[0051] The above description of the invention is intended 
to be illustrative and not limiting. Other embodiments of 
this invention will be obvious to those skilled in the art 
in view of the above disclosure. 
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